Automatic Indexing of Handwritten Medical Forms for Search Engines

نویسندگان

  • Robert Milewski
  • Venu Govindaraju
چکیده

A new paradigm, which models the relationships between handwriting and topic categories (denoted as ‘concepts’), in the context of medical forms, is presented. The ultimate goals are (i) the recognition of medical handwriting, and (ii) the use of such information for a medical form search engine. Medical forms have diverse, complex and large lexicons consisting of English, Medical and Pharmacology corpus. This technique shows that a handwriting recognition engine, with just a few recognized characters, can be used to represent a medical concept. This allows (i) a reduced lexicon to be constructed, thereby improving the performance of handwriting recognition engines [6][21], and (ii) unseen PCR forms to be tagged with a concept and later searched. Both practical and theoretical numbers are reported. This research builds the notion of a ‘computational semantic lexicon’ which was vaguely introduced in our IWFHR 2002 paper [15] and incorporates other research in the area of call-routing [2][3].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ارزیابی خودکار جویش‌گرهای ویدئویی حوزه وب فارسی بر اساس تجمیع آرا

Today, the growth of the internet and its high influence in individuals’ life have caused many users to solve their daily needs by search engines and hence, the search engines need to be modified and continuously improved. Therefore, evaluating search engines to determine their performance is of paramount importance. In Iran, as well as other countries, extensive researches are being performed ...

متن کامل

Using a Hidden-Markov Model in Semi- Automatic Indexing of Historical Handwritten Records

Indexing of historical records is a process that uses human effort to read text images and convert them into a machine readable format that facilitates search. The Church of Jesus Christ of Latter-day Saints has been using volunteers to index millions of microfilm images of genealogy records collected throughout the world. This indexing process is time-consuming. We adapt a technique for holist...

متن کامل

Indexing and retrieval of handwritten medical forms

POSTER PAPER. This paper proposes an approach of indexing and retrieving degraded handwritten documents. We present a modified version of the popular Vector Model in information retrieval (IR). Our model incorporates top n candidates from a HR system into the scheme of calculating the term frequency (tf) and the inverted document frequency (idf). Standardized IR Tests show that the proposed app...

متن کامل

A Search Engine for Handwritten Documents

The design and functionality of a versatile search engine on handwritten documents is described. Documents are indexed using global image features, e.g., stroke width, slant, word gaps, as well local features that describe shapes of characters and words. Image indexing is done automatically using page analysis, page segmentation, line separation, word segmentation and recognition of characters ...

متن کامل

A Query-by-Similarity Indexing Strategy for Web Forms

Search engines do not provide speci c searches for Web forms related to the Deep Web, in particular, similarity search. To deal with this lack, we propose a query-by-similarity system called WF-Sim, and this paper presents the indexing strategy adopted by WF-Sim for querying-by-similarity Web forms. It is centered on suitable index structures to the main kinds of queries posed on Web forms, as ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006